Probability laws related to the Jacobi theta and 
Riemann zeta functions, and Brownian excursions 

Philippe Biane* Jim Pitman ^ and Marc Yor* 
February 8, 2008 



Abstract 

This paper reviews known results which connect Riemann's integral represen- 
tations of his zeta function, involving Jacobi's theta function and its derivatives, to 
some particular probability laws governing sums of independent exponential vari- 
ables. These laws are related to one-dimensional Brownian motion and to higher 
dimensional Bessel processes. We present some characterizations of these proba- 
bility laws, and some approximations of Riemann's zeta function which are related 
to these laws. 
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1 Introduction 



In his fundamental paper [|63|| , Riemann showed that the Riemann zeta function, initially 
defined by the series 



oo 



n=l 

admits a meromorphic continuation to the entire complex plane, with only a simple pole 
at 1, and that the function 

fa) := fas - l)*-Prfa)C{8) (Rs>l) (2) 

is the restriction to (dts > 1) of a unique entire analytic function £, which satisfies the 
functional equation 

t( S ) = ai-s) (3) 

for all complex s. These basic properties of ( and £ follow from a representation of 2£ 
as the Mellin transform of a function involving derivatives of Jacobi's theta function. 
This function turns out to be the density of a probability distribution on the real line, 
which has deep and intriguing connections with the theory of Brownian motion. This 
distribution first appears in the probabilistic literature in the 1950 's in the work of Feller 
24fl , Gnedenko [p6fl , and Takacs |7lH , who derived it as the asymptotic distribution as 



n — > oo of the range of a simple one- dimensional random walk conditioned to return to 
its origin after 2n steps, and found formula (|5|) below for s — 1, 2, • • •. Combined with 
the approximation of random walks by Brownian motion, justified by Donsker's theorem 
||, [20], |62| , the random walk asymptotics imply that if 



Y := \ - ( max b u — min b u } 

V n \0<U<1 0<m<1 J 



(4) 



where (b u , < u < 1) is the standard Brownian bridge derived by conditioning a one- 
dimensional Brownian motion (B u , < u < 1) on B = B\ = 0, then 

E(Y S ) = 2£( a ) (s G C). (5) 

where E is the expectation operator. Many other constructions of random variables with 
the same distribution as Y have since been discovered, involving functionals of the path 
of a Brownian motion or Brownian bridge in R d for d — 1, 2, 3 or 4. 

Our main purpose in this paper is to review this circle of ideas, with emphasis on the 
probabilistic interpretations such as (H)-(H) of various functions which play an important 
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role in analytic number theory. For the most part this is a survey of known results, but 
the result of Section |5] may be new. 

Section |2| reviews the classical analysis underlying (|5]), and offers different analytic 
characterizations of the probability distribution of Y. Section R| presents various formulae 
related to the distributions of the random variables E/j and E^ defined by 

2 00 r 2 00 r 

Eh := ^ X) ~1? and S * := ~X? "TTi ' ( 6 ) 

n=l n=l v 

for independent random variables I\ n with the gamma(h) density 

P(T h , n e dx)/dx = r(/i)- 1 x h - 1 e - x (t > 0). (7) 

Our motivation to study these laws stems from their close connection to the classical 
functions of analytic number theory, and their repeated appearances in the study of 
Brownian motion, which we recall in Section f|. For example, to make the connection 
with the beginning of this introduction, one has 

S 2 i -Y 2 (8) 

7T 

where = means equality in distribution. As we discuss in Section ^, Brownian paths 
possess a number distributional symmetries, which explain some of the remarkable coin- 
cidences in distribution implied by the repeated appearances of the laws of E ft and E^ 
for various h. Section |^ shows how one of the probabilistic results of Section |3] leads 
us to an approximation of the zeta function, valid in the entire complex plane, which is 
similar to an approximation obtained by Sondow . We conclude in Section ^| with 



some consideration of the Hurwitz zeta function and Dirichlet L-functions, and some 
references to other work relating the Riemann zeta function to probability theory. 

2 Probabilistic interpretations of some classical an- 
alytic formulae 

2.1 Some classical analysis 

Let us start with Jacobi's theta function identity 

1 00 00 

7= e- (n+x)2/t = cos(2wnr) e~ n ^ H {x G R, t > 0) (9) 



TVt 

n=— 00 
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which is a well known instance of the Poisson summation formula 0. This identity 
equates two different expressions for pt/2(0, %), where pt(0, x) is the fundamental solution 
of the heat equation on a circle identified with [0, 1], with initial condition 5q, the delta 
function at zero. In probabilistic terms, pt(0,x) is the probability density at x G [0,1] 
of the position of a Brownian motion on the circle started at at time and run for 
time t. The left hand expression is obtained by wrapping the Gaussian solution on the 
line, while the right hand expression is obtained by Fourier analysis. In particular, @ 
for x = can be written 

y/ie(t) = 9{r 1 ) (t>o) (io) 

where 6 is the Jacobi theta function 

oo 

6(t) := Yl exp(-n 2 7rf) (t > 0). (11) 

n=— oo 

For the function £ defined by (|2|), Riemann obtained the integral representation 

= J" t i-\0(t) - l)dt (»(*)> 1) (12) 

by switching the order of summation and integration and using 

POO 

r(s) = / x"- 1 e- x dx (9ta>0). (13) 
Jo 

He then deduced his functional equation £(s) = £(1 — s) from (0) and Jacobi's functional 
equation (pTOj) . Following the notation of Edwards [p22|, §10.3], let 

oo 

G(y) := 6{y 2 ) = ^ exp(-vrn 2 Z/ 2 ) (14) 

n=— oo 

so Jacobi's functional equation ([TOf) acquires the simpler form 

yG(y) = G(y- 1 ) (y > 0). (15) 



The function 



that is 



dy 



2yG(y) + y 2 G"(y) (16) 
= V ^(2vr 2 nV - 37rn 2 ) e -™ V (17) 



n=l 
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satisfies the same functional equation as G: 



yH{y) = Hiy- 1 ) (y > 0). (18) 

As indicated by Riemann, this allows (15) to be transformed by integration by parts for 



> 1 to yield 

r>oo 

.s-1 



2£(a) = / y s - L H(y)dy. (19) 
Jo 

It follows immediately by analytic continuation that (|T^p serves to define an entire 
function £(s) which satisfies Riemann's functional equation £(s) = £(1— s) for all complex 
s. Conversely, the functional equation flTB| ) for H is recovered from Riemann's functional 
equation for £ by uniqueness of Mellin transforms. The representation of £ as a Mellin 
transform was used by Hardy to prove that an infinity of zeros of ( lie on the critical 
line. It is also essential in the work of Polya |JT] and Newman ||9| on the Riemann 
hypothesis. But the probabilistic interpretations of flT8|) which we discuss in this paper 
do not appear in these works. 

2.2 Probabilistic interpretation of 2£(s) 

As observed by Chung |L4f and Newman H(y) > for all y > (obviously for 
y > 1, hence too for y < 1 by QTJ)). By (|), (§ and ~ (s - 1) _1 as s 1, 

2£(0) = 2£(1) = 1, 

so formula ( |T5D for 5 = and 5 = 1 implies 

POO POO 

/ y~ l H(y)dy = / H{y)dy = 1. 

That is to say, the function y^Hiy) is the density function of a probability distribution 
on (0, oo) with mean 1. Note that the functional equation (|18|) for H can be expressed 
as follows in terms of a random variable Y with this distribution: for every non-negative 
measurable function g 

E[g(l/Y)]=E[Yg(Y)]. (20) 

The distribution of 1/Y is therefore identical to the size-biased distribution derived from 
Y. See Smith-Diaconis |68| for further interpretations of this relation. The next lemma, 



which follows from the preceding discussion and formulas tabulated in Section gathers 
different characterizations of a random variable Y with this density. Here Y is assumed 
to be defined on some probability space (fl, J 7 , P), with expectation operator E. 
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Proposition 1 ( p4"| , §[) i*br a non-negative random variable Y , each of the following 
conditions (i) - (iv) is equivalent to Y having density y~ 1 H(y) for y > 0: 

(i) 

E(Y') = 2£(s) (s e C); 



(ii) fory>0 
that is 



P(Y <y) = G(y) + yG\y) = -y- 2 G'{y-' 



7m 2 /y 2 



n=l 



(iii) with £ 2 defined by (|6]) 



iv 



E 



.-AY 2 



sinh v7rA 



(21) 
(22) 

(23) 
(24) 
(25) 



3 Two infinitely divisible families 



This section presents an array of results regarding the probability laws on (0, oo) of the 
random variables and defined by ([]) with special emphasis on results for h — 1 
and h = 2, which are summarized by Table 1. Each column of the table presents features 
of the law of one of the four sums £ = Si, £ 2 , Sf of Those in the E 2 column can 
be read from Proposition IT], while the formulae in other columns provide to analogous 
results for E 1; Sf and E 2 instead of S 2 . While the Mellin transforms of Ei, S 2 and 
all involve the function £ associated with the Riemann zeta function, the Mellin 
transform of £f involves instead the Dirichlet L-function associated with the quadratic 
character modulo 4, that is 

00 ( —i) n 

L ^ s y-=Y,7triY (**>«)■ (26) 

n=0 ^ ' 

We now discuss the entries of Table 1 row by row. 
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Table 1 




n=l 


9 °° c _1_ c 
n=l 




V2A 




sinh v^A 


ysinhv^y 


Levy density 


1 

n=l 


2pi(x) 


/(*) := ^P(S < x) 


, oo 

" ^_^ri e -ra 2 7r 2 a:/2 

dec 

n=— oo 


, oo 

* y (l_n 2 A)e-» 2A / 2 
dx v 

n=— oo 


reciprocal relations 




\7TX/ \7l 2 X J 






^-E [(E 2 )V^(E 2 )] 




CHX 1 ) (§)'**> 


g)\(2.) 




_|_ (2 3n_2" +1 )(-l)" +1 J B 2n 


^ j (2n-l)2 3 "(-l)^5 2 „ 
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Table 1 continued 


oo 

1 ' 7T 2 ^ (n - |) 2 

71=1 v ^' 


oo . 
v # ^ \ ^ £71 + £n 

2 •" 7T 2 ^ (n - \f 


1 

cosh \/2X 


( 1 V 

\cosh\/2A/ 


oo 

p f( X ) := I £ e - 2 (n-l/2)V2 
X 

n=l 




oo 

7r^(-ir(n + i)e-(" + ^ 2 ^/ 2 

ra=0 


1 oo 

5 £ «» + i) 

n=— oo 


Vx - 1) e -("+i) 2 - 2 -/ 2 


/?(*) = ( 




ft(*) = - ( 






(sf)- 1/2 ^(sf)] 


0) 3/2 ^[(s 1 )- 1/2 ^(Si)] 


r( s + i)2 s+1 


/„ \ 2s+l 

("J ^ + 1) 


( 2 2(«+l) _ 1) 

s + 1 


0) S+1 e(2( S + l)) 


(2n)! 2 " ( - 1) "^ 


(2 2 " +2 -l)2 3 " +1 n! 

(n + l)(2n)! ( 1} ^ 
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3.1 Laplace transforms and Levy densities 



Recall from (j|) that := \ Yun=i %r where the distribution of the independent 
gamma(/i) variables r/j jn is characterized by the Laplace transform 



£[exp(-Ar M )] = (1 + A) 



Euler's formulae 



00 / z 2 \ 00 / z 2 \ 

smhz = zH (l + ^J and cosh z = J] f 1 + - 2 j 

n=l ^ ' n=l ^ ^ 2' ' 



allow the following evaluation [75, I3|: for 3?(A) > 



(27) 



(2f 



£ [e -AEh ] = F 



and similarly 



H exp 



71=1 



2Ai\„ 

n 2 ii 2 



n=l 



n i+ 



2A 
n 2 ir 2 



sinh V2A 



F[exp(-A£#)] 



(29) 



(30) 



,cosh V2A 

Levy densities p{x). A probability distribution F on the line is called infinitely divis- 
ible if for each n there exist independent random variables T n l , . . . , T n n with the same 
distribution such that XT=i -^M ^ as distribution F. According to the Levy-Khintchine 
representation, which has a well known interpretation in terms of Poisson processes |79|, 



§11.64], a distribution F concentrated on [0, 00) is infinitely divisible if and only if its 
Laplace transform 92(A) :— J °° e~ xt F(dt) admits the representation 



99(A) = exp I — cA 



(1 



v{dx] 



(A>0) 



(31) 



for some c > and some positive measure v on (0, 00), called the Levy measure of F, 
with c and v uniquely determined by F. 

If v(dx) = p(x)dx then p(x) is called the Levy density of F. It is elementary that 
for Th with gamma(/i) distribution and a > the distribution of T^/a is infinitely 
divisible with Levy density hx~ x e~ ax . It follows easily that for a n > with X]n a ™ < 00 
and independent gamma(/i) variables Th t n the distribution of ^„ri, in /a„ is infinitely 
divisible with Levy density hx^ 1 ^2 n e~ anX . Thus for each h > the laws of and 
S* are infinitely divisible, with the Levy densities indicated in the Table for h = 1 and 
h = 2. 
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We note that Riemann's formula ( |T2] ) for s = 2p can be interpreted as an expression 
for the pth moment of the Levy density p\ of S x : for 3lp > | 



:= / t p Pl (t)dt 



f 2 \ p 1 f°° 2? 

(-J 3/ iTWy)-i)di/ = ^r(p)c(2p). (32) 

3.2 Probability densities and reciprocal relations 

By application of the negative binomial expansion 

(T^f^g^^" ( ft >0, N <l) (33) 

there is the expansion 

( t \ h = 2H h e-* h = 2^^ T(n + h) (2n+h)t 

\smht) ~ (l-e- 2t ) h T(h)^ Q T(n + l) 1 1 

which corrects two typographical errors in || (3.v)], and 

(_}_\ h = ^- th = fv ir nn+h) h)t 

\cosht) ~ (l + e- 2t ) h J T{n + 1) ' 1 ' 

The Laplace transform fl3"U| ) can be inverted by applying the expansion ( |3"o] ) and inverting 
term by term using Levy's formula [45] 

:e -ay(2t) e -Xt dt = e -aV2X_ (36) 



00 /7 



Thus there is the following expression for the density f*(t) := P(S# G dt)/dt: for 
arbitrary real h > 0: 

f%) 2 " Vf ir r(n + fe)(2n + /t) cxp f (2 " + / *H (37) 

h {t) - T(h) ^} 1} r(n + 1) P V » J ' (37) 



A more complicated formula for fhif) was obtained from fl34|) by the same method in 
|| (3.x)]. The formulae for the densities of and S* displayed in Table 1 for h — 1 
and h = 2 can be obtained using the reciprocal relations of Row 5. The self-reciprocal 
relation involving S 2 is a variant of fllTf ), while that involving Sf , which was observed 
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by Ciesielski- Taylor is an instance of another application of the Poisson summation 
formula which is recalled as (p4j ) in Section |6.1j . Lastly, the reciprocal relation involving 

_ll _ll 

the densities fx of Si and f% of E^ amounts to the identity 

P(Ei < x) = ^ (-l)"e-" 2ff21 '/ 2 = ^ e -2(n+2) 2 /- (38) 

7i=— 00 n=— 00 

where the second equality is read from (f|) with x = 1/2 and t replaced by x/2. 
Formulae for E [g (-^)]« These formulae, valid for an arbitrary non-negative Borel 
function function g, are integrated forms of the reciprocal relations, similar to (|20|). 



3.3 Moments and Mellin transforms 

It is easily shown that the distributions of and E* have moments of all orders (see 
e.g. Lemma f|). The formulae for the Mellin transforms i?(E s ) can all be obtained by 
term-by-term integration of the densities for suitable s, followed by analytic continuation. 
According to the self- reciprocal relation for E^ , for all s e C 



E 



2^1 



E 



# 



2^1 



(39) 



Using the formula for E((T,f) s ) in terms of L XA defined by fl26|), given in Table 1, we see 
that if we define 



A X4 (t) :=E 



IT y# 

2 ^1 



t+1 



t+1 

4\ — 

7T 



(40) 



then fl39|) amounts to the functional equation 



A X4 (1 







(t e C). 



(41) 



This is an instance of the general functional equation for a Dirichlet L— function, which 
is recalled as ([55|) in Section ||. 

Positive integer moments E7(E"). These formulas are particularizations of the pre- 
ceding Row, using the classical evaluation of (C2n) in terms of the Bernoulli numbers 
B 2n . The result for E{ involves the Euler numbers E 2n defined by the expansion 



cosh(,2) 



e z + e 



n=0 



„2f( 



(2n) 



(42) 
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A multiplicative relation. Table 1 reveals the following remarkably simple relation: 



>l-2s 



1 



mi) = [ x _ 2s ) mi) (43) 

where the first factor on the right side is evaluated by continuity for s = 1/2. By 
elementary integration, this factor can be interpreted as follows: 

)l-2s i ■ 

1 E(W~ 2s ) (44) 



1 - 2s 



for a random variable W with uniform distribution on [1,2]. Thus (f43|) amounts to the 
the following identity in distribution: 



Ex = W- 2 Z 2 (45) 



where W is assumed independent of £2. An equivalent of (|45|) was interpreted in terms 
of Brownian motion in (4)]. 



Note that (f45|) could be rewritten as Si = W~ 2 (Y,i + £1) for £1, £1 and W indepen- 
dent random variables, with £1 having the same distribution as £1, and W uniform on 
[1, 2]. By consideration of positive integer moments, this property uniquely characterizes 
the distribution of £1 among all distributions with mean 1/3 and finite moments of all 
orders. 

3.4 Characterizations of the distributions of E2 and Sf 



As just indicated, the identity (45) allows a simple probabilistic characterization of the 
distribution of £1. The following Proposition offers similar characterizations of the dis- 
tributions of £2 and £J • 

Proposition 2 Let X be a non-negative random variable, and let X* denote a random 
variable such that 

P{X* E dx) = xP{X E dx)/E{X). 
(i) X is distributed as £ 2 if and only if E(X) = 2/3 and 

X* = X + HX* (46) 
where X, H and X* are independent, with 

P{H e dh) = {h- 1/2 - l)dh (0 < h < 1) 
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(ii) X is distributed as E* if and only if E(X) = 2 and 

X* = X + U 2 X (47) 
where X , U and X are independent, with X distributed as X and U uniform on [0, 1]. 

For the proof of this Proposition, note that ( f46|) (or (0)) imply that the Laplace trans- 
form of X satisfies an integro-differential equation, whose only solution is given by the 
appropriate function. The "only if" part of (i) appears in p2| , p. 26]. Details of the 



remaining parts will be provided elsewhere. As remarked by van Harn and Steutel [74] , 
it is an easy consequence of the Levy-Khintchine formula that for non-negative random 
variables X and Y the equation X* = X + Y is satisfied for some Y independent of X 
if and only if the law of X is infinitely divisible. As discussed in |74] , [J, the distribu- 
tion of X*, known as the size-biased or length-biased distribution of X, has a natural 
interpretation in renewal theory. 



4 Brownian interpretations 

It is a remarkable fact that the four distributions considered in Section |3| appear in many 
different problems concerning Brownian motion and related stochastic processes. These 
appearances are partially explained by the relation of these distributions to Jacobi's 
theta function, which provides a solution to the heat equation [j5], 23[], and is therefore 



related to Brownian motion E2, §5.4]. We start by introducing some basic notation 
for Brownian motion and Bessel processes, then present the main results in the form of 
another table. 



4.1 Introduction and Notation 

Let (3 := (j3 t , t > 0) be a standard one- dimensional Brownian motion, that is a stochastic 
process with continuous sample paths and independent increments such that (3q = 0, 
and for all s, t > the random variable (3 s+ t — (3 S has a Gaussian distribution with mean 
E(j3 s+t — p s ) = and mean square E[(p s+t — Ps) 2 } = t, meaning that for all real x 

P(P s+ t - Ps < x) = -L T e-y 2 /^ dy. 

V Znt J-oo 

Among continuous time stochastic processes, such as semimartingales, processes with 
independent increments, and Markov processes, Brownian motion is the paradigm of a 
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stochastic process with continuous paths. In particular, among processes with stationary 
independent increments, the Brownian motions (<jB t + fit, t > 0) for a > 0, \x G K are 
the only ones with almost surely continuous paths |64], 1.28.12]. Brownian motion arises 
naturally as the limit in distribution as n — ■> oo of a rescaled random walk process 
(S n , n = 0, 1, . . .) where S n — X\ + ■ ■ ■ + X n for independent random variables Xj with 
some common distribution with mean E(X,j) = and variance E(Xf) = 1. To be more 
precise, let the value of S r be extended to all real r > by linear interpolation between 
integers. With this definition of (S r , r > 0) as a random continuous function, it is known 
that no matter what the distribution of the Xi with mean and variance 1, as n — > oo 



*" t t>0) ±{[%,t>0) (48) 



n 

in the sense of weak convergence of probability distributions on the path space C[0, oo). 
In particular, convergence of finite dimensional distributions in (|4"8"D follows easily from 
the central limit theorem, which is the statement of convergence of one dimensional 
distributions in that is for each fixed t > 

' S '" / " A (49) 

"n 



where — > denotes weak convergence of probability distributions on the line. Recall that, 
for random variables W n , n = 1, 2, . . . and W such that W has a continuous distribution 
function x \— > P(W < x), W n — > means P(W n < x) — > P(Vr < x) for all real x. See 



0, |62| for background. 

Let (b t , < t < 1) be a standard Brownian bridge, that is the centered Gaussian 
process with the conditional distribution of (f3 t ,0 < t < 1) given /3i = 0. Some well 
known alternative descriptions of the distribution of b are |]62| , Ch. Ill, Ex (3.10)] 

(6t,0<t <1) = (A-*#l,0<*<1) = ((l-t)A /( i_i),0<t < 1) (50) 

where = denotes equality of distributions on the path space C[0, 1], and the rightmost 
process is defined to be for t = 1. According to a fundamental result in the theory of 
non-parametric statistics [19|, |66|, the Brownian bridge arises in another way from the 
asymptotic behaviour of the empirical distribution 



n 



F n {x) := - > l(X k <x) 
n 

k=l 
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where the are now supposed independent with common distribution P{Xi < x) — 
F(x) for an arbitrary continuous distribution function F. As shown by Kolmogorov f43f , 
the distribution of sup,,. \F n (x) — F(x)\ is the same no matter what the choice of F, and 
for all real y 



lim P(v^sup \F n (x) - F(x)\ < y) = V (-l) n e 



n„-2n 2 y 2 



(51) 



For F the uniform distribution on [0, 1], so F(t) = t for < t < 1, it is known that 



(Vn(F n (t)-*),0<*<l) 



< t < 1). 



(52) 



As a well known consequence of (|52|) , Kolmogorov's limiting distribution in (^T|) is iden- 
tical to the distribution of max <i<i \b t \. On the other hand, as observed by Watson |76| 



Kolmogorov's limit distribution function in ( |5lD is identical to that of |ySi. Thus we 
find the first appearance of the law of Si as the law of a functional of Brownian bridge. 

To put this in terms of random walks, if (S n ) is a simple random walk, meaning 
P{Xi = +1) = P(Xi = -1) = 1/2, then 



S- 



2nt 



2n 



< t < 1 



S- 



2)i 







{b t ,0<t< 1) 



(53) 



where on the left side the random walk is conditioned to return to zero at time 2n, and 
on the right side the Brownian motion is conditioned to return to zero at time 1. Thus 



1 



max I Sk I 

2n 0<k<2n 



2n 







max \b t \ 

0<t<l 



d 7T 



(54) 



where the equality in distribution summarizes the conclusion of the previous paragraph. 
In the same vein, Gnedenko |26[ derived another asymptotic distribution from random 
walks, which can be interpreted in terms of Brownian bridge as 



max Sk — min Sk 

0<k<2n 0<k<2n 



S2n = ] max b t - min b t = (55) 

0<t<l 0<i<l 2 



The equalities in distribution in both (|54D and (^) can be deduced from the formula 



P( min b u > —a, max b u < b) = > e 

0<U<1 0<M<1 ' 



2k 2 (a+b) 2 



k=—oo 



E 1 

k=— oo 



-2[b+k(a+b)] 2 



(56) 
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of Smirnov |J7J and Doob JHJ. Kennedy [[JIJ found that these distributions appear again 
if the random walk is conditioned instead on the event (R = 2n) or (R > 2n), where 

R := mf{n > 1 : S n = 0} 

is the time of the first return to zero by the random walk. Thus 

d 7T 



1 



max I Sk 

2n 0<k<2n 



R = 2n 



max e t 

0<t<l 



(57) 



where (e t , < t < 1) denotes a standard Brownian excursion, that is the process with 
continuous sample paths defined following |33|, 35], |21| by the limit in distribution on 
C[0, 1] 



\s> 



2nt\ 



2n 



< t < 1 



R = 2n) ^ (e t ,0 < t < 1). 



(58) 



A satisfying explanation of the identity in distribution between the limit variables fea- 
tured in fl55| ) and ( |5T| ) is provided by the following identity of distributions on C[0, 1] 
due to Vervaat fl75| : 

(e u , < u < 1) = (6 p+M{mo di) - 6 P , < u < 1) (59) 

where p is the almost surely unique time that the Brownian bridge b attains its minimum 
value. As shown by Takacs[[fl]] and Smith-Diaconis pB |, either of the approximations 
(^5|) or (|57|) can be used to establish the differentiated form fl22f ) of Jacobi's functional 



equation ([15]) by a discrete approximation argument involving quantities of probabilistic 
interest. See also Polya |6(| for a closely related proof of Jacobi's functional equation 
based on the local normal approximation to the binomial distribution. 
In the same vein as (|55|) and 



1 



max I Sk 

2n 0<k<2n 



there is the result of H35i |2ll that 
R > 2n | — max mt 

0<i<l 



7Ta/Si 



(60) 



where (m t , < t < 1) denotes a standard Brownian meander, defined by the limit in 
distribution on C[0, 1] 



\S- 



2nt 



2n 



< t < 1 



R > 2n) ^ (m t ,0 <t<l). 



(61) 



The surprising consequence of (|54] ) and (|60|), that max < ( <i m ( = 2max < t <i |6 t |, was 
explained in || by a transformation of bridge b into a process distributed like the meander 
m. For a review of various transformations relating Brownian bridge, excursion and the 
meander see [|]. 



17 



4.2 Bessel processes 



The work of Williams |J77| , [7q , [79| shows how the study of excursions of one- dimensional 
Brownian motion leads inevitably to descriptions of these excursions involving higher 
dimensional Bessel processes. For d — 1, 2, ... let Rd := (Rd,t, t > 0) be the d- dimensional 
Bessel process BES(d) , that is the non-negative process defined by the radial part of a 
c?-dimensional Brownian motion: 



2 

i,t 

i=l 



where (-Bj,t, t > 0) for % = 1, 2, ... is a sequence of independent one-dimensional Brownian 
motions. Note that each of the processes X = B, and X = Rd for any d > 1, has the 
Brownian scaling property: 

(X u ,u>0) = (V^X u/c ,u>0) (62) 

for every c > 0, where = denotes equality in distribution of processes. For a process 
X = (X t ,t > 0) let X and X_ denote the past maximum and past minimum processes 
derived from X, that is 



X t := sup X s ; X_ t := inf X s . 

0<s<t ' 0<s<t 

Note that if X has the Brownian scaling property (^) then so too do X, X, and X — X. 
For a suitable process X, let 

(L*(X),t > 0,x G K) 
be the process of local times of X defined by the occupation density formula 

pt poo 

/ f(X s )ds= / f(x)L*(X)dx (63) 

for all non-negative Borel functions /, and almost sure joint continuity in t and x. See 
52 , Ch. VI] for background, and proof of the existence of such a local time process for 
X = B and X = Rd for any c? > 1. 

Let := (rd jU ,0 < u < 1) denote the d-dimensional Bessel bridge defined by con- 
ditioning < u < 1 on Rd t \ = 0. Put another way, r\ is the sum of squares of d 
independent copies of the standard Brownian bridge. 
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4.3 A table of identities in distribution 



We now discuss the meaning of Table 2, which presents a number of known identities 
in distribution. The results are collected from the work of numerous authors, including 
Gikhman |25|, Kiefer Chung 0, Biane-Yor [g. See also gg, ^ §BJ. In the 



following sections we review briefly the main arguments underlying the results presented 
in the table. 

Each column of the table displays a list of random variables with the distribution 
determined by the Laplace transform in Row 0. Each variable in the second column is 
distributed as the sum of two independent copies of any variable in the first column, and 
each variable in the fourth column is distributed as the sum of two independent copies of 
any variable in the third column. The table is organized by rows of variables which are 
analogous in some informal sense. The next few paragraphs introduce row by row the 
notation used in the table, with pointers to explanations and attributions in following 
subsections. Blank entries in the table mean we do not know any construction of a 
variable with the appropriate distribution which respects the informal sense of analogy 
within rows, with the following exceptions. Entries for Rows 4 and 6 of the S 2 column 
could be filled like in Row 3 as the sums of two independent copies of variables in the 
Si column of the same row, but this would add nothing to the content of the table. The 
list of variables involved is by no means exhaustive: for instance, according to fl5q) the 
variable (4/7r 2 )(6 1 — 6 1 ) 2 could be added to the second column. Many more constructions 
are possible involving Brownian bridge and excursion, some of which we mention in 
following sections. It is a consequence of its construction, that each column of the table 
exhibits a family of random variables with the same distribution. Therefore it is a natural 
problem, coming from the philosophy of "bijective proofs" in enumerative combinatorics 
(see e.g. Stanley [7D|), to try giving a direct argument for each distributional identity, 
not using the explicit computation of the distribution. Many such arguments can be 
given, relying on distributional symmetries of Brownian paths, or some deeper results 
such as the Ray-Knight theorems. However, some identities remain for which we do not 
have any such argument at hand. As explained in Section |4.6| , some of these identities 
are equivalent to the functional equation for the Jacobi theta (or the Riemann zeta) 
function. 

Rows and 1. Row displays the Laplace transforms in A of the distributions of the 
variables in Row 1, that is Si, S 2 , Sf and S 2 , as considered in previous sections. 
Row 2. Section |4]4] explains why the distributions of the random variables f$ r du du 
and J Q R 2 du du for d = 2 and d = 4, indicated in this row. 
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Row 3. Most of the results of this row are discussed in Section fO. Here 



T a {X) := inf{* : X t = a} 

is the hitting time of a by a process X, and is an independent copy of the Bessel 
process Rd- Note that R± := \B\ is just Brownian motion with reflection at 0, and 
Ti(B — B_) is the first time that the range of the Brownian B up to time t is an interval 



of length 1. The result that ATi(B — B_) has Laplace transform 1/ cosh v2A is due 



to Imhof gO 



See also Vallois |72|, f73| , Pitman |5T| and Pitman- Yor [^6j for various 
refinements of this formula. 

Rows 4 and 5 These rows, which involve the distribution of the maximum of various 
processes over [0, 1], are discussed in Section [OJ 

Row 6 Here Wii is the maximum of the standard Brownian meander (m u ,0 < u < 1). 
This entry is read from (^D[). 

Row 7. The first two entries are obtained from their relation to the first two entries in 
Row 5, that is the equalities in distribution 

1 du d n _ , f 1 du d „_ 
= lr\ i and / = 2r 3 i. 

o m u Jo r 3iU 

These identities follow from descriptions of the local time processes (Lf(rd),x > 0) for 
d = 1 and d = 3, which involve m for d = 1 and r 3 for d = 3, as presented in Biane-Yor 



H Th. 5.3]. See also |52|, Cor. 16] for another derivation of these results. The last two 
entries may be obtained through their relation to the last two entries of Row 2. More 
generally, there is the identity 



ds 



R, 



d,s 



^ci- 



ds 



-1/2 



{d> 1) 



which can be found in Biane-Yor || and Revuz-Yor g2[ Ch. XI, Corollary 1.12 and p. 
448]. 

Row 8. Here n := inf{t : L° t {B) = 1} where R 1 = \B\ and (Lf(B),t >0,i6l) is the 



local time process of B defined by (^). The distribution of Ti/R 1 „ was identified with 



■n 



B_ T1 ) 2 was identified 



that of 4T 1 ( J R 3 ) by Knight while the distribution of n/(B n 
with that of Ti(R 3 ) +T!(i?3) by Pitman- Yor |p6[ . The result in the third column can be 
read from Hu-Shi-Yor f§|, p. 188]. 
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Table 2 



0) 



2A 



sinh V2A 



sinh v 2 A 



1 



cosh V2A 



cosh V2A 



1) 



2 Ae, 



7r^ 14 — ' rr 

n=l 



v 2 \ ^ £ n + £ n 
^ 2 - _ ~ 2 2 



n=l 



n=l 



(n-|)= 



2) 



./o 



Rludu 



R\ u du 



3) 



Ti(i? 3 ) 



Ti(i?i) 



AT^B-B) 



4) 



— x -2 



— \ -2 



4(51-^) 



5) 



6) 



:^i) s 



7) 



i r 1 d«v 

Wo «W 



1 /'' r/,/ N 2 
Wo r 3 „ 



1 Z" 1 ciw 



2 Jo -R 



2 ,u 



1 f 1 du 



2 Jo ^3,i* 



8) 



4(R 



1,71, 



<1 JO 



B 2 t dt 



21 



4.4 Squared Bessel processes (Row 2) 

For d — 1,2,... the squared Bessel process R 2 d is by definition the sum of d independent 
copies of R\ = B 2 , the square of a one-dimensional Brownian motion B, and a similar 
remark applies to the squared Bessel bridge r\. Following Levy [|^, fJ7|, let us expand 
the Brownian motion (B t , < t < 1) or the Brownian bridge (6 t , < t < 1) in a Fourier 
series. For example, the standard Brownian bridge b can be represented as 

b u = sin(7mu) (0 < u < 1) 

n=l 

where the Z n for n = 1,2,... are independent standard normal random variables, so 
E(Z n ) = and E(Z 2 ) = 1 for all n. Parseval's theorem then gives 



„! oo 

/ 6£d« = E 



7 2 

7T 2 n 2 



n=0 

so the random variable Jg 1 &^c?m appears as a quadratic form in the normal variables Z n . 

It is elementary and well known that Z\ — 271/2 for 71/2 with gamma(|) distribution as 
in <g) and (|7|) for h = \. Thus 

E[exp(-A^)] = (l + 2A)- 1 / 2 



and 

E 



exp I —A / b\du 



n 1+ 



s 1/9 / V 1/2 

2A \ _1/2 / v/2A 



7r 2 n 2 y \sinhV2A 



n=l 

by another application of Euler's formula (p5[) . Taking two and four independent copies 
respectively gives the first two entries of Row 2. The other entries of this row are obtained 
by similar considerations for unconditioned Bessel processes. 



Watson 761 found that 



1 

d 







bt - / b u du dt = |E X . (64) 



Shi-Yor give a proof of ( pip with the help of a space-time transformation of the 
Brownian bridge. See also p. 18-19], p. 126-127] and papers cited there for 
more general results in this vein. In particular, we mention a variant of (^4j) for B 
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instead of 6, which can be obtained as a consequence of a stochastic Fubini theorem [ST 
p. 21-22]: 

J ( B t~ J q Budu^J dt = J b 2 u du = E 1/2 . (65) 
As remarked by Watson, it is a very surprising consequence of (|64] ) and fl54|) that 



h — / b u du | dt = n 2 max 6?. (66) 
Jo / °St<i 

As pointed out by Chung ||14|| , the identities in distribution (|55| ) and ([57]), where £2 is 
the sum of two independent copies of Si, imply that the distribution of (maxo<f<i&t — 
min <t<i b t ) 2 is that of the sum of two independent copies of max <t<i bf. In a similar 
vein, the first column of Table 2 shows that the distribution of Att~ 2 max <[<i b\ is in turn 

that of the sum of two independent copies of J* b 2 dt — Ex/2- There is still no explanation 
of these coincidences in terms of any kind of transformation or decomposition of Brownian 
paths, or any combinatorial argument involving lattice paths, though such methods have 
proved effective in explaining and generalizing numerous other coincidences involving the 
distributions of E^ and E* for various h > 0. Vervaat's explanation (|59|) of the identity 
in law between the range of the bridge and the maximum of the excursion provides one 
example of this. Similarly, (|59|) and (0) imply that 

e*— / e u du) dt = \E X . (67) 







4.5 First passage times (Row 3) 

It is known [16, 31, 36] that by solving an appropriate Sturm-Liouville equation, for 
A > 

EeM-^UR,)) = { ' Mr = = f] (l + 



2"T{v + 1)/,(V2A) 

where v := {d — 2)/2 with I v the usual modified Bessel function, related to J v by 
{ix) u I I u {ix) = x v I J v (x), and j v>1 < jV,2 < • • • is the increasing sequence of positive zeros 
of J v . That is to say, 



00 



n=l 

where the e n are independent standard exponential variables. See also Kent [0, [38| , and 
literature cited there, for more about this spectral decomposition of Ti(X), which can 



23 



be formulated for a much more general one- dimensional diffusion X instead of X = R^. 
The results of Row 3, that 



T X {R X ) = Sf and T X {R 3 ) = E x 



are the particular cases d — 1 and d = 3 of (|68D , corresponding to z/ = ±1/2, when I u and 
can be expressed in terms of hyperbolic and trigonometric functions. In particular, 
(n — i)7r and ji/2, n = are the nth positive zeros of the cosine and sine 



J-l/2,n = [n 

functions respectively. Comparison of Rows 2 and 3 reveals the identities 

Ti(#i) = [ R\ u du and Ti(R 3 ) = [ r\ u du. 
Jo Jo 

As pointed out by Williams [77|, |7^, 79 1, these remarkable coincidences in distribution 
are the simplest case g(u) = 1 of the identities in law 



Ti(Ri) 



g(l-R 1>t )dt 



and 



g(R^t)dt 



R lu9(u)du 



rl u g(u)du 



(69) 



(70) 



where the two Laplace transforms involved are again determined by the solutions of a 
Sturm-Liouville equation H, H, Ch. XI]. Let Lf(R d ),t > 0,x E R) and Lf(r d ),0 < 
t < 1, x G M) be the local time processes of Rd and r d defined by the occupation density 
formula (|63| ) with B replaced by Rd or r^. Granted existence of local time processes for 
Rd and rd, the identities (|69|) and ([f(]) are an expression of the Ray-Knight theorems 



62], Ch. XI, §2] that 



and 



(Ll-^iU < u < 1) = (i? 2 2 u , 0<u<l) 



(L u Ti{R3) (R 3 ) } 0< u < 1) i (r 2 2 u ,0 < u < 1). 



(71) 



(72) 



The next section gives an interpretation of the variable T X (R 3 ) + Ti(R 3 ) appearing in 
column 2 in terms of Brownian excursions. 
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4.6 Maxima and the agreement formula (Rows 4 and 5) 



The entries in Row 4 are equivalent to corresponding entries in Row 3 by application to 
X = R d and X = B — B_ of the elementary identity 



{X: 



\-2 A 



Ti(X) 



(73) 



which is valid for any process X with continuous paths which satisfies the Brownian 
scaling identity fl62|), because 



P((X!)" 2 >t) = P(X 1 < t~2) = P(X t < 1) = P{T X {X) > t). 

The first entry of Row 5, with fn := maxo< u <i \b u \, is read from fl5ip. The second entry 
of Row 5, involving the maximum r 31 of a three-dimensional Bessel bridge (r 3 u , < 
u < 1), is read from the work of Gikhman |25] and Kiefer [5S], who found a formula 
for P(fdi < x) for arbitrary d — 1,2,.... See also [p8 |. This result involving r 31 may 
be regarded as a consequence of the previous identification (^7|) of the law of e\, the 

maximum of a standard Brownian excursion, and the identity in law ex = r 31 implied 



by the remarkable result of Levy- Williams [[47], O > ^hat 



(e t ,0<t < 1) = (r 3 , t ,0<t<l). 



(74) 



Another consequence of the scaling properties of Bessel processes is provided by the 
following absolute continuity relation between the law of (jd,i)~ 2 arid the law of 



J 2,d 



T\{Rd) + Ti(R d ) 



for general d > 0. This result, obtained in || 0, 0, 
for every non-negative Borel function g 



} , we call the agreement formula: 



E [g((r d ^- 2 )} = C d E [E^^(E 2j£ 



(75) 



where C d '■— 2^ 2 ^ 2 T(d/2). In J|5| the agreement formula was presented as the special- 
ization to Bessel processes of a general result for one-dimensional diffusions. As explained 
in H 80[ [55], the agreement formula follows from the fact that a certain <r-finite mea- 
sure on the space of continuous non-negative paths with finite lifetimes can be explicitly 
disintegrated in two different ways, according to the lifetime, or according to the value 
of the maximum. 

Note from ( |68D that S 2>3 = £2 and S 2i i = £*. For d = 3 formula ( |75D gives for all 
non-negative Borel functions g 



E[g{f 3>1 )} 



IE 



S 2 0(1/ 



(76) 



25 



In view of ([76]), the symmetry property ( [20] ) of the common distribution of Y and */|E 2 , 
which expresses the functional equations for £ and 6 1 , can be recast as the following 
identity of Chung |TJ[], which appears in the second column of Table 2: 

(^ 3ll ) 2 ^S 2 . (77) 

As another application of (^), we note that for d = 1 this formula shows that the 
reciprocal relation between the laws of Ei and E* discussed in Section |] is equivalent to 
the equality in distribution of fl54|) , that is 

(^i,i) 2 =£i- (78) 

We do not know of any path transformation leading to a non-computational proof of 
©or©. 



4.7 Further entries. 



The distributions of Ti(R d ) and Ti(R d ) + Ti(R d ) for d — 1, 3 shared by the columns of 
Table 2, also arise naturally from a number of other constructions involving Brownian 
motion and Bessel processes. Alili found the remarkable result that 



n 2 



coth(/ir 3jU )(iM I — 1 



E 2 for all \i ^ 0. 



(79) 



As a check, the almost sure limit of the left side of ([79]) as /i — » is the variable 
^^(Jo r 3,u^ u ) 2 m ^ ne second column of Row 7. As shown by Alili-Donati-Yor 0, con- 
sideration of (|79D as — > oo shows that 



dt 



Ei 



exp(i? 3jt ) - 1 

Other results are the identity of Ciesielski- Taylor |16| according to which 

poo 

/ l{R d+ 2,t<l)dt = T x {R d ) 
Jo 



(80) 



51) 



which for d = 1 and = 3 provides further entries for the table. See also [41], |51 



81] , p. 97-98, Ch. 7], |82[ p. 132-133], JST] for still more functional of Brownian motion 
whose Laplace transforms can be expressed in terms of hyperbolic functions. 
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5 Renormalization of the series J^n s . 
5.1 Statement of the result 

The expansion of Si as an infinite series (^) suggests that we use partial sums in order 
to approximate its Mellin transform. As we shall see, this yields an interesting approxi- 
mation of the Riemann zeta function. Consider again the relationship (0) between ( and 
£, which allows the definition of ((s) for s ^ 1 despite the lack of convergence of ([I]) for 
9fts < 1 . There are a number of known ways to remedy this lack of convergence, some of 



which are discussed in Section |5.5| . One possibility is to look for an array of coefficients 
(a n N, 1 < n < N) such that the functions 



Kn(s) 



N 

E 



0>n,N 

n s 



converge as iV — > oo, for all values of s. For fixed N there are N degrees of freedom in the 
choice of the coefficients, so we can enforce the conditions kn{s) = ({s) at iV choices of s, 
and it is natural to choose the points 0, where £(0) = — |, and —2, —4, —6, . . . , — 2(iV— 1) 
where £ vanishes. It is easily checked that this makes 



0"n,N 



{-N){1- N){2- N) ...{n 



N) 



(N + 1)(N + 2)...(N + n) 



2N 
N -n 



2N 
N 



Note that for each fixed n 
and recall that 



a>n,N ( — 1)™ as iV — ► oo 



(2 1 - s -l)C(s) = ^L^for Ks>l 



71=1 



extends to an entire function of s. One has 



Theorem 3 For £j, 1 < i < N independent standard exponential variables, and 9fts > 
-2N 



E 




2 s/2_ 



7T 



N 

E 

n=l 



n s 
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where the a njN are defined by (|8"3|) ; and 



(2 1 "' - l)C(a) asN^oo (85) 

n=l 

uniformly on every compact subset of C. 

The proof of Theorem [3] occupies the next two sections. 

5.2 On sums of independent exponential random variables 

Let (e n ; n > 1) be a sequence of independent identically distributed random variables, 
with the standard exponential distribution P(e n > x) — e~ x ,x > 0. Let (a n ; n > 1) 
be a sequence of positive real numbers, such that Y^Li a n < oo, then the series X = 
Y^=i a n£ n converges almost surely, and in every LP space, for 1 < p < oo. 

Lemma 4 Let X = J^^Li a n£n as above, and let X^ = ^2n=i a n £ n be the partial sums, 
then for every real x one has E[X X ] < oo, and E[X%\ < oo for x > —N. Furthermore 
one has 

E[X S N ] -> E[X S ] as N -> oo 
uniformly with respect to s on each compact subset of C. 

Proof. We have already seen that i^Jf 1 ] < oo if x > 0. Let us prove that .ELY^] < oo 



for > x > —N. Let &at = min(a n ;n < N), then Xn > 5jvY/v = ^NY^n=i £ n- But Yj 



N 



has a gamma distribution, with density jrmt N - 1 e- t at t > 0, so that £[Y$ < oo for 
> x > —N and thus < oo. The assertion for X follows from X > X^. It 

remains to check the uniform convergence. If 3fts G [—A, +A], then 



\X S N -X S 



x 

8-1, 



sy s 'dy 

< IsKX-XnXX^VX*- 1 ) 



and the required uniform convergence as iV — > oo is now evident by application of the 
Cauchy-Schwarz inequality. □ 

We now compute the Mellin transform of the distribution of Yln=i a n£n, assuming 
that the a n are all distinct and strictly positive. 
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Lemma 5 With the above notations, and H n ,N '■= Ylj^n i<j<N fl — 7r) > f or ^ s > 



E 



N 

E 



N 



n=l 



16) 



where the right side of $3b\ ) is defined by continuity for s = — 1, —2, . . . , —N + 1. 
Proof. The partial fraction expansion of the Laplace transform 



E 



N 



exp 



A a n£n 



n=l 



N N 

n l - =y— - 

Al(l + Aon) ^ Ii n ,N (1 + Aa r 

n=l v ' n=l ' v 



(87) 



E 



implies that for every non-negative measurable function g such that E\g(a n £\)\ is finite 
for every n 

r / v \ "i 

^=^-^(0^1)]. (88) 

For (yf(x) = x s this gives (0), first for real s > — 1, then also for -fts > — N since the 
previous lemma shows that the left side is analytic in this domain, and the right side is 
evidently meromorphic in this domain. □ 



N 
>n=l 



Note the implication of the above argument that the sum on the right side of (|36l 
must vanish at s = —1, —2, . . . , — N + 1. 



5.3 Proof of Theorem 3 

This is obtained by applying the preceding results in the particular case a n = n~ 2 . By 
application of Lemmas ^ and |5|, and the formula for E(T,{) in Table 1, found in |pH| , (86)], 
the conclusions of Theorem |3| hold for 



- 2a 



fU(j 2 ) 



n,N 



n 



n,N 



7T 



(89) 



where both products are over j with 1 < j < N and j ^ n. The product in the numerator 
is (N\/n) 2 while writing j 2 — n 2 = (j — n)(j + n) allows the product in the denominator 
to be simplified to (— l) n_1 (./V + n)\(N — n)\/{2n 2 ). Thus the expression for a ni jv can be 
simplified to (jB9|). □ 
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5.4 The case of the L X4 function 



The following result can be obtained similarly, with the help of the formula for E((Ef) s ) 
in terms of L X4 defined by ([26|) , given in Table 1 of Section ||. 



Theorem 6 Let 



N-l 



2N-1 

4»>( S ) :=£(-!)» A^Zl^( 2n+i: 



n=0 



N-l 



2N - 1 



Then v ~^^ n ^ — — > 1 as N — > oo; /or eac/i i\T, one /ias L^(l — 2&) = /or = 
\ N ) 

1,2, ... N — 1, and L^\s) — > £ X4 (s) uniformly on every compact of C. 

We note that it is also possible to use formulae © and (H) to provide another approxi- 
mation of £. We leave the computation to the interested reader. 

5.5 Comparison with other summation methods 

Perhaps the simplest way to renormalize the series ([I]) is given by the classical formula 

( - N l ~ s - 1 \ 1 
COO = lim HT n- s - — - > 0) (90) 

AT-»oo \ ' 1 — S / 1 — S 

\n=l / 

Related methods are provided by the approximate functional equation and Riemann- 
Siegel formula, which are powerful tools in deriving results on the behaviour of the zeta 



function in the critical strip. See e.g. Ch. 7 of Edwards |22| for a detailed discussion. 
It is also known |44], |28| that the series Y1T ^— 7 is Abel summable for all values of 



s G C, meaning that as z — > 1 in the unit disk, 
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1 

The Lerch zeta function 



g2i7rnx 



$(x,a,s) = V- (x 6 M, < a < 1, &(s) > 1) 



n=0 



30 



is known to have analytic continuation to s G C, with a pole at s = 1 for a = 1, a: 6 Z. 
This allows us to sketch another proof of Theorem [| The formula 

Kn ( s ) = 2 2N (™\ [\sm{7tx)) 2N <l>{x,l,s)dx 



is easily checked using (jg2"|)-([g3"D for 3?(s) > 1, and extended by analytic continuation 
to all values of s G C. Convergence of kn{s) towards (2 1_s — l)C(s) then follows from 
continuity properties of the Lerch zeta function in the variable x, and the fact that 



" 2N ( ^ ) ( sin ( 7rx )) 2JV dx 5 V2 weakly as iV oo. 
finally, we note that J. Sondow [B^] has shown that ] 



Finally, we note that J. Sondow |5J|] has shown that Euler's summation method yields 
the following series, uniformly convergent on every compact of C: 



,1-s 



°° ) 2 "* + --- + (- 1 ) , ( + 

KM = E — w 2J+ , w • 

3=0 



Furthermore the sum of the first N terms of this series gives the exact values of ( at 
0, —1, —2, . . . , —N + 1, so we can rewrite the partial sum in ( |9~1~1) as 

N , 

i 

where the 6 n> jv are completely determined by Pn(~j) — C(~j) for j = 0, 1, . . . , AT — 1. 
Compare with the discussion between ( p2[ ) and fl83|) to see the close parallel between ( p5| ) 
and (BB). 



6 Final remarks 

6.1 Hurwitz's zeta function and Dirichlet's L-functions 

Row 3 of Table 2 involves hitting times of Bessel processes of dimension 1 and 3, started 
from 0. If the Bessel process does not start from zero, we still have an interesting formula 
for the Mellin transform of the hitting time, expressed now in terms of the Hurwitz zeta 
function. Specifically, one has 



#[ e -ATf(ii3)i = sinh(ay / 2A) E[ e -xi?(Rift = cosh ( a v 2 ^) ^ 
asinh(v / 2A) cosh(v^2A) 
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where T" denotes the hitting time of 1, starting from a e]0, 1[, of the corresponding 
Bessel process. Expanding the denominator we get 



Smh(ay2A) _ 1 -(2n+l-a)V2X _ e -(2n+l+a) v / 2A 

asinh( v / 2A) a - 



n=0 



Inverting the Laplace transform yields the density of the distribution of Tf (i? 3 ) 

1 oo 

—^= ^2{2n + 1 - a ) e -(^+l-af/(2t) _ ( 2n + 1 + a)e -(2n + l+a)V(2t) 



Taking the Mellin transform we get 

£[(Tf (i? 3 )) s/2 ] = ^ (c(*, ^) - C(«, ^)) (»(*) > i) 

where Cl 5 ?^) = S^=o( n + X )~ S ^ s ^ ne Hurwitz zeta function. This identity extends by 
analytic continuation to all s G C. A similar expression exists for T^(Ri). 

One can use the product expansion for sinh in order to give an approximation of 

C(s,u) - ((s, 1 - u)-u e]0, 1[. For it is easy to see that [^=1 (l + ^r) (l + f ) _1 is 
the Laplace transform of a probability distribution on [0, oo[, and that this probability 
distribution converges towards that of T®(R 3 ), in such a way that there is a result similar 
to Theorem |3[ 

The Hurwitz zeta function can be used to construct Dirichlet L-functions by lin- 
ear combinations. However, direct probabilistic interpretations of general Dirichlet L- 
functions, in the spirit of what we did in Section ^ do not seem to exist. More precisely, 
let x be a primitive character modulo N, and let 

+oo 

x {t) = Y, ™ € x(n)e-™ 2t (93) 

n=— oo 

where e = or 1 according to whether \ is even or odd, so x(— 1) = (— These 
functions satisfy the functional equation 

m - (m) (94) 

where t(x) is a Gauss sum. Taking a Mellin transform, this yields the analytic continu- 
ation and functional equation for the associated Dirichlet L-function 



n" 
n=i 
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namely 

A(s, x) = (-iyr( X )N- s A(l - s, x) (95) 

where 

A(s , x) = ;r ^r(i±f) L, W . 

See |17j or [TT|, §1.1] for the classical derivations of these results. For general real x-, there 



does not seem to be any simple probabilistic interpretation of x (t). In particular, this 
function is not necessarily positive for all t > 0. This can be seen as follows. We choose 
an odd character x (the case of even characters is similar), and compute the Laplace 
transform of 9 X using 



J „ J „ i 



n)e~- VnX 



Using the periodicity of x, this equals 

N'^nZl X{n)e~^ _ iV- 1/2 Ei=T 1)/2 x(^)sinh(^v^A) 



1 - e ~ 2y/7rX sinh(V7rA) 



(96) 



For small values of N, and x a rea l °dd character modulo N, one can see by inspection 
that this indeed is the Laplace transform of a positive function, hence by uniqueness 
of Laplace transform, 6 X is positive on the real line. However Polya |59j exhibited an 



infinite number of primes p such that for the quadratic character modulo p the polynomial 
Q p (x) := Y7n=i xn x{ n ) takes negative values somewhere on [0,1]. In particular, for 
p = 43 we find Q43(3/4) pa —0.0075. For such quadratic Dirichlet characters, the Laplace 
transform above also takes negative values, which implies that 9 X does not stay positive 
on [0, oof. We note that if 6 X > on ]0, oo[ then obviously its Mellin transform has no 
zero on the real line, and hence the corresponding L-function has no Siegel zeros. 

6.2 Other probabilistic aspects of Riemann's zeta function 

It is outside the scope of this paper, and beyond the competence of its authors, to discuss 
at length the theory of the Riemann zeta function. But we mention in this final Section 
some other works relating the zeta function to probability theory. 

The work of Polya has played a significant role in the proof of the Lee- Yang theorem 



in statistical mechanics: see the discussion in [pTT , pages 424-426]. Other connections 



between Riemann zeta function and statistical mechanics appear in Bost and Connes 



10 1 and in Knauf [40] 
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The Euler product for the Riemann zeta function is interpreted probabilistically in 
Golomb 



and Nanopoulos 48 



via the independence of various prime factors when 
choosing a positive integer according to the distribution with probability at n equal to 
-J ,,-s f or some s > 1. See also Chung [|13|, p. 247] and |I8] . 



n 



coo- 

It is an old idea of Denjoy |1| that the partial sums of the Mobius function should 
behave like a random walk (the law of iterated logarithm would imply Riemann hypoth- 
esis). 

Fascinating connections between zeros of the Riemann zeta function (and other L- 
functions) and eigenvalue distribution of random matrices are currently under intense 
scrutiny. See Odlyzko p(| and Katz and Sarnak |34 . 



Finally let us mention a few other recent references where the Riemann zeta function 
makes some appearances: Asmussen, Glynn and Pitman ||, Joshi and Chakraborty |3"2" 
Chang and Peres (12 
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